Source controlled variable bit-rate speech coder based on waveform interpolation

نویسندگان

  • Fabrice Plante
  • Barry M. G. Cheetham
  • David F. Marston
  • P. A. Barrett
چکیده

This paper describes a source controlled variable bit-rate (SCVBR) speech coder based on the concept of prototype waveform interpolation. The coder uses a four mode classification : silence, voiced, unvoiced and transition. These modes are detected after the speech has been decomposed into slowly evolving (SEW) and rapidly evolving (REW) waveforms. A voicing activity detection (VAD), the relative level of SEW and REW and the cross-correlation coefficient between characteristic waveform segments are used to make the classification. The encoding of the SEW components is improved using a gender adaptation. In tests using conversational speech, the SC-VBR allows a compression factor of around 3. The VBR coder was evaluated against a fixed rate 4.6kbit/s PWI coder for clean speech and noisy speech and was found to perform better for male speech and for noisy speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Closed-loop Multimode Variable Bit Rate Characteristic Waveform Interpolation Coder

A variable bit rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.86kbps average bit rate which combines closed-loop multimode techniques is presented in this paper. Each kind of characteristic waveform (CW) surface is regarded as only rapidly evolving waveforms (REWs), only slowly evolving waveforms (SEWs) or mixed REWs plus SEWs in different cases of CWs evolving p...

متن کامل

A new low bit rate speech coder based on intraframe waveform interpolation

A new characteristic waveform (CW) interpolation coder is proposed in this paper. In the proposed coder, two characteristic waveforms are extracted from LPC residual signal at each frame. The Waveform Interpolation (WI) is operated within the frame. In the novel WI, variable dimension vector quantization (VDVQ) and power vector quantization are proposed and the low frequency band (LFB) and high...

متن کامل

A Low-complexity Improved WI Speech Coding at 2kbps

The waveform interpolation (WI) speech coding presents a good performance at low bit rate. However, the algorithm has a very high complexity in computation. In this paper, a low-complexity improved waveform interpolation speech coder at 2kbps is proposed. The improved coding scheme has greatly reduced the computational complexity and improved the reconstructed speech quality by using various te...

متن کامل

A 1.7KBPS waveform interpolation speech coder using decomposition of pitch cycle waveform

In this paper, we propose a low bit rate waveform interpolation speech coder where the novelty lies with an e ective decomposition method of pitch cycle waveform(PCW). PCWs exhibit very di erent perceptual characteristics in di erent frequency bands. For frequency components below 1kHz, they are quantized using Variable Dimensional Vector Quantization(VDVQ) scheme. Hereby retaining the ne harmo...

متن کامل

A Flexible Multirate Speech Coder

This paper describes algorithms which provide the capability of parametrically coding speech using the Sinusoidal Transform Coding (STC) at a variety of bit rates and transforming the coded bit stream to lower rate bit stream to lower rates without interaction with the source, through a set of techniquescalled parameter space transformations. Parameter space transformations are a generalization...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998